ON  THE  EULER  FUNCTION  OF  THE  CATALAN 

NUMBERS 
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Abstract.  We  study  the  solutions  of  the  equation  / <t>{Cn)  =  r, 

where  r  is  a  fixed  rational  number,  Ck  is  the  fcth  Catalan  number  and  <j> 
is  the  Euler  function.  We  note  that  the  number  r  =  4  is  special  for  this 
problem  and  for  it  we  construct  solutions  (m,  n )  to  the  above  equation 
which  are  related  to  primes  p  such  that  2p  —  1  or  4p  —  3  is  also  prime. 


1.  An  observation  concerning  (f(Cn+i)  /  4>{Cn) 


For  a  positive  integer  n.  let 

(1)  Cn 


n  +  1 


2  n 
n 


be  the  n-th  Catalan  number.  For  a  positive  integer  m  we  put  for  the 

Euler  function  of  m. 

A  Carmichael’s  conjecture  [5],  which  is  still  open,  states  that  for  every  n 
it  is  possible  to  find  an  m  ^  n  such  that  <f>(m)  =  <j>(n).  Since  this  problem 
seems  to  be  currently  out  of  reach,  one  would  look  at  the  behavior  of  the 
Euler’s  phi  function  </>(•),  or  at  quotients  </>(•)/(/>(•),  when  the  arguments 
belong  to  some  smaller  classes  of  integers,  like  the  binomial  coefficients, 
binary  recurrent  sequences,  or  even  Catalan  numbers.  In  fact,  there  is  a 
growing  literature  on  arithmetic  functions  with  binomial  coefficients  [12], 
[15]  and  [17],  or  on  arithmetic  functions  with  members  of  binary  recurrent 
sequences  [3,  11,  13,  14,  16,  18,  19,  20,  21],  etc. 

At  first,  we  wanted  to  test  whether  we  could  find  distinct  m  and  n  such 
that  <f>(Cm )  =  4>(Cn)  but  did  not  find  solutions  other  than  the  trivial  solution 
4>(Ci)  =  (j)[C2)  =  1.  So,  we  checked  numerically  for  the  values  of  the  ratios 
(j)(Cm) / (j>(Cn)  for  m  /  n.  While  computing  such  ratios  for  small  values  of 
m  and  n,  we  first  noted,  then  we  proved,  the  following  result. 


Theorem  1.1.  The  equality 

(2)  <t>(Cn+i)  =  4  <t>(Cn) 

holds  in  each  of  the  following  two  instances: 
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(i)  n  =  2p  —  2,  where  p  >  5  is  a  prime  such  that  q  =  4p  —  3  is  also  a 
prime. 

(ii)  n  =  3p  —  2,  where  p  >  5  is  a  prime  such  that  q  =  2p  —  1  is  also 
prime. 

Proof.  We  have 

(3) 

For  (i),  we  use  (3)  with  n 
getting 

(4)  pCn+ 1  =  qCn. 

Hence, 

(5)  Cn+ 1  =  qC  and  Cn  =  pC 

for  some  positive  integer  C.  Since  q  =  2n  +  1  >  2 n,  it  follows  that  q  does 
not  divide  Cn,  so  in  particular  q  does  not  divide  C.  Since  p  =  (n  +  2)/2,  it 
follows  that  p3 ||(2n)!  and  p2||n!(n  +  1)!,  so  p\\Cn.  Here  and  in  what  follows, 
for  a  prime  p  and  positive  integers  a  and  m  we  write  pa\\m  when  pa  \  m 
but  pa+l  |  rn.  It  follows  that  p  \  C.  Thus,  gcd {pq,C)  =  1.  Applying  the 
Euler  function  to  the  equalities  in  (5),  and  taking  the  ratio  of  the  resulting 
relations  we  get 

<t>(Cn+ 1)  =  (g-  1 Mg)  =  4 
<j>(cn)  (p-mc)  ’ 

because  q  —  1  =  2n  =  4 (p  —  1),  which  is  what  we  wanted.  The  argument  for 
(ii)  is  similar.  Namely,  in  this  case  n  +  2  =  3p  and  2n  +  1  =  6p  —  3  =  3 q,  so 
that  instead  of  relation  (4)  we  get 

(7)  pCn+ 1  =  2  qCn. 

Hence, 

(8)  Cn+ 1  =  2  qC  and  Cn  =  pC 

for  some  positive  integer  C.  Let  us  first  see  that  C  is  even.  If  C  is  odd,  then 
Cn  is  odd,  therefore  n  =  2“  —  1  for  some  positive  integer  a  (see  [1]).  Thus, 
p  =  (n  +  2)/3  =  (2a  +  l)/3,  is  an  integer,  so  a  is  odd.  Further,  since  p  >  5, 
it  follows  that  a  >  5.  Next, 

q  =  2p  -  1  =  - — 1 1  =  i(2(a+1)/2  +  l)(2(a+1)/2  -  1). 

Since  a  >  5,  it  follows  that  both  numbers  2^a+1^2  +  1  and  2^a+1^2  —  1  are 
larger  than  3,  and  in  particular,  q  =  (2a+1  —  l)/3  cannot  be  prime,  which  is 
a  contradiction.  This  shows  that  C  is  even.  Since  q  =  (2 n  +  l)/3,  it  follows 
that  f/2||(2n)!  and  g2||n!(n  +  1)!,  so  q\Cn.  Thus,  q  \  C.  Since  p  =  (n  +  2)/3, 
it  follows  that  p5||(2n)!  and  p4||n!(n  +  1)!,  so  p\\Cn.  In  particular,  p  \  C. 


_  2(2n  +  l) 
n+1  n  +  2 

=  2p  —  2  where  both  p  and  q  =  4p  —  3  are  primes, 
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Thus,  we  have  that  gcd {pq,C)  =  1.  Taking  the  Euler  function  in  relations 
(8)  and  dividing  the  resulting  expressions  we  get 

^{Cn+ll  =  2{q-mCl  ^  , 

4>(Cn)  (p-l)(j){C) 

because  (</  —  !)  =  2 (n  —  l)/3  =  2 (p  —  1),  which  is  what  we  wanted.  □ 


2.  The  Main  result 


As  we  shall  see  later,  there  are  many  other  solutions  to  (2)  and  we  shall 
explain  how  to  find  some  of  them.  We  do  not  know  if  there  are  infinitely 
many  primes  p  such  that  4p  —  3  is  a  prime,  or  2p  —  1  is  a  prime.  It  follows  by 
the  Hardy  and  Littlewood  conjectures  (see  [9])  that  for  large  x  the  number  of 
such  primes  should  be  asymptotically  cq  x/ (log  x )2  for  some  positive  constant 
Co-  We  asked  ourselves  whether  it  is  likely  for  some  positive  integer  n  to 
exist  another  positive  integer  m  such  that  (j)(Cn)  / (j)(Cm)  is  a  fixed  rational 
number  r.  In  the  above,  we  allow  r  =  1,  but  in  this  case  we  impose  that 
m  n.  More  precisely,  for  a  fixed  r  €  Q+,  define  the  following  set 


(9) 


Mr 


(  .  mA 

l  '  <t>{Cn) 


r  holds  for  some  m 


For  a  large  real  number  x  we  put  Mr{x)  =  MrC\  [1,  x\.  Computer  experiments 
turned  up  lots  of  solutions  for  r  =  4  and  the  symmetrical  r  =  1/4,  but  very 
few  solutions  for  other  values  of  r.  We  asked  ourselves  if  r  =  4  and  r  =  1/4 
are  special  in  this  respect.  Our  main  result  below  together  with  the  above 
Hardy  and  Littlewood  conjectures  seem  to  indicate  that  this  is  indeed  the 
case. 

In  what  follows,  we  use  the  Landau  symbols  O  and  o  as  well  as  the 
Vinogradov  symbols  <C,  3>  and  x  and  ~  with  their  usual  meanings.  The 
constant  and  speed  of  convergence  implied  by  them  might  depend  on  our 
parameter  r.  Recall  that  A  =  O(B),  and  B  3>  A  are  all  equivalent 

and  mean  that  the  inequality  |A|  <  cB  holds  with  some  positive  constant 
c.  Further,  A  x  B  means  that  both  A  «  B  and  B  <C  A  hold,  A  =  o(B) 
means  that  A/B  tends  to  zero,  whereas  A  ~  B  means  that  A/B  tends 
to  1.  We  use  Co,  ci,  ...  for  positive  constants  which  might  depend  on  our 
parameter  r.  We  write  P[m )  and  p(m)  for  the  largest  and  smallest  prime 
factor  of  the  positive  integer  m,  respectively.  We  write  p.  q  and  p  with  or 
without  subscripts  for  prime  numbers.  For  a  positive  real  number  x  we  write 
log  x  for  the  natural  logarithm  of  x. 


Theorem  2.1.  The  estimate 

X 

(10)  #Mr(x)  <  — - ‘  holds  for  r  014,1/4}  as  x  — >  oo. 

(logx)J+0i1^ 

However, 

(11)  #Mr(x)  < 


(logx)2 


holds  when  r  =  4,  1/4  for  all  x  >  10. 
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3.  The  proof  of  Theorem  2.1 

Since  r  is  fixed,  we  write  only  A f(x)  and  omit  the  dependence  on  r.  We 
let  x  be  large  and  let  Ad(x)  =  M  n  {x/2,x\.  It  is  enough  to  prove  that 
the  upper  bounds  (10)  and  (11)  hold  on  #Ad(x),  since  afterwards  the  same 
upper  bounds  on  #AT(x)  will  follow  by  replacing  x  with  x/2,  then  with  x/4, 
and  so  on. 


3.1.  An  upper  bound  for  \m  —  n\.  We  use  the  asymptotic 

22n 

(12)  C'n~ci^7r  as  n-Aoo, 

nai* 

where  ci  =  1  /y/n  (see  Exercise  9.8  in  [6]).  We  also  use  the  fact  that  the 
bounds 


(13) 


<  <!>{£)  <  i 


log  log  l 

hold  for  all  positive  integers  t  >  3  (see  Theorem  328  in  [10]).  Using  esti¬ 
mate  (13)  with  i  =  Cm  and  t  =  Cn,  we  get  that 


)2m 


)2m 


~)2  n 


)2n 


(14) 


rrv 


3/2  ^g 


m 


<  < 


m 


3/2 


and 


3/2  i0g 


n 


n 


<  <p(Cn)  < 


n 


3/2  ' 


Assume  now  that  n  G  M(x)  and  that  rn  ^  n  is  such  that  r  =  < f>(Cm) / cf>(Cn) . 
Taking  logarithms  and  using  estimates  (14),  we  get 


log r|  =  | log  (<j>(Cm) / <f>(Cn))\  =  2\m  —  n\  log 2  +  0(log(m  +  n)). 


The  above  estimate  shows  that  rn  =  n  +  (9 (log  x).  We  return  to  (14)  and 
observe  that  in  fact  it  yields 

log  4>(Cm)  =  2?nlog2  —  (3/2)  logm  +  O(loglogm), 
log  (j>{Cn)  =  2n log  2  —  (3/2)  log n  +  0(log log n). 


Applying  estimate  (15)  with  n  and  m  and  taking  the  difference  of  the  re¬ 
sulting  relations,  we  get  that 


log  4>(Cm)  ~  log  4>(Cn)  =  2{m  —  n)  log  2  -  (3/2)  log(m/n)  +  0( log  log  x) 

=  2{m  —  n)  log  2  —  (3/2)  log  ^1  +  O 
+  O  (log  log  x) 

=  2(m  —  n)  log  2  +  O(loglogx). 

We  thus  get  that 

I  log 7*|  =  [log  (<j)(Cm) / 4>{Cn))\  =  2\m  —  n\  log  2  +  0(log  log x), 

which  implies  that  m  =  n  +  O(loglogx). 

Let  C2  be  the  constant  implied  by  the  previous  O-symbol.  We  also  let 
K  =  C2  log  log  x.  Thus,  m  =  n  +  k,  where  0  <  |fc|  <  K.  We  write  A4^(x) 
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for  the  set  of  n  £  M  ( x )  for  which  there  exists  m  with  m  =  n  +  k  such  that 
<j>(Cm) /<f>(Cn)  =  r.  Clearly, 

#M{x)<  £ 

0<\k\<K 

It  remains  to  estimate  # M^(x).  We  treat  only  the  case  of  the  positive 
number  k,  since  the  case  when  k  is  negative  can  be  dealt  with  in  a  similar 
way.  We  fix  the  number  k  <  K. 


3.2.  Deducing  the  STMN  equation.  We  have 


(16) 


2k(2n  +  l)(2n  +  3)  •  •  •  (2n  +  2k  —  1) 
(n  +  2)(n  +  3)  •  •  •  (n  +  k  +  1) 


so  that 


(17)  (n  +  2)  •  •  •  (n  +  k  +  1  )Cm  =  2fc(2 n  +  l)(2n  +  3)  •  •  •  (2 n  +  2 k-  1  )Cn. 

Observe  that  if  p  \  n+i  +  1  and  p  \  2n+2j  —  1  holds  for  some  i,j  €  {1, . . . ,  k}, 
then  p  |  2j  —  2i  —  3,  and  this  last  number  is  odd  and  has  absolute  value  at 
most  2 K  +  1.  Thus,  such  primes  p  are  at  most  2 K  +  1.  The  same  is  true 
for  prime  factors  p  common  to  n  +  i  +  1  and  n  +  i\  +  1  for  some  i  ^  i\  in 
{1,2,...,  k},  as  well  as  for  prime  factors  p  common  to  both  2 n  +  2j  —  1  and 
2 n  +  2ji  —  1  for  some  j  /  j\  also  in  the  set  {1,2,...,  k}.  The  above  relation 

(17)  can  be  written  as 

(18)  UMCm  =  VNCn, 

where  U,V  are  coprime  integers  with  P(UV)  <  2 K  +  1,  M  and  N  are 
coprime  integers  with  p(MN)  >  2 1\  +  1,  and 

(19) 

(n  +  2)  •  •  •  (n  +  k  +  1)  =  UMD  and  2k(2n  +  1)  •  •  •  (2n  +  2k  —  1)  =  VND, 
for  some  positive  integer  D  with  P(D )  <  2 K  +  1,  where 

D  =  gcd  ^(n  +  2)  •  •  •  (n  +  k  +  1),  2k{2 n  +  1)  •  •  •  (2 n  +  2 k  —  1)^  . 
Equation  (18)  gives 

(20)  Cm  =  VNC  and  Cn  =  UMC 


for  some  positive  integer  C.  Write  C  =  IVT^/TA/TArT,  where 
r7  =  n  Pap  for  each  of  /  €  {17,  V,  M,  N}, 

Pap\\c 

p\I 

and  T  is  the  largest  divisor  of  C  which  is  coprime  to  UVMN .  We  now  apply 
the  Euler  function  to  the  two  relations  (20)  getting 

HCm)  =  (KVMNFvTNttruWTMMT), 

'  “  !  !  (fr(Cn)  =  (l,{U)<l>{M)YuTM<f>{Tv)(l>{TN)<t>{T). 
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Write  r  =  u/v  with  coprime  positive  integers  u  and  v.  Observe  that  the 
relation  r  =  (j)(Cm) / <j>(Cn) ,  is  the  same  as  u<f>{Cn)  =  v(j)(Cm),  which,  via  the 
relations  (21),  leads  to 

(22)  S</>(M)  Tm  <KTn)  =  T  4>{N)  VN  <j>(VM), 

where  S  =  u^>{U)Yu^>{Ty')  and  T  =  v(j>(V)Yy(j>(Yu)  have  the  property  that 
P(ST)  <  2K  +  1  provided  that  x  is  large  enough,  say  large  enough  such 
that  2 K  +  1  >  max{ji,jj}.  We  refer  to  (22)  as  the  STM N -equation. 

3.3.  Large  and  very  large  primes.  Next  let  C3  be  some  absolute  constant 
to  be  determined  later  and  put  y  =  (log  x) 10  and  z  =  x1^03  loglogI).  We  also 
put  J  =  (y,z\.  We  say  that  a  prime  p  is  large  if  p  >  y  and  very  large  if 
p  >  z.  Hence,  primes  in  J  are  large  but  not  very  large. 


3.4.  The  case  when  (n  +  1  )MN  is  divisible  by  the  square  of  a  large 

prime.  Let  A4^(x)  be  the  subset  of  n  £  M^k\x)  for  which  p2  \  (n  +  l)MN 
for  some  large  prime  p.  We  assume  that  x  is  sufficiently  large  such  that 
y  >  2K  +  3.  It  then  follows  that  either  there  exists  %  £  {0, 1,2,...,  fc}  such 
that  p2  |  n  +  *  +  1,  or  j  G  {1,2,...,  fc}  such  that  p2  \  2n  +  2 j  —  1.  Since 

max{n  +  i  +  1,  2n  +  2j  —  1}  <  x  +  2 K  —  1  <  2x, 


for  large  x,  it  follows  that  the  number  of  such  positive  integers  n  <  x  for  a 
fixed  i  (or  j)  is  at  most  2 x/p2.  Varying  i  (or  j)  in  {1, ... ,  k},  it  follows  that 
the  number  of  such  possibilities  is  <  4 (k  +  1  )x/p2.  Summing  this  up  over 
all  the  large  primes  p,  we  get  that 


<  i{K  +  l  r$) 

P>yp  xy  Jy  J 

.x(log  log  x) 

(logx)10 

Putting  M\(x)  =  Uo<|fc|<A:-^difc)(x)>  we  Set  that 


16/lx 

V 


(23) 


#A4i(x)  <  Y.  #M(fc)(x)« 

0<\k\<K 


x(log  logx)2 
(logx)10 


((logx)3)  ’ 


as  x  -A  00.  The  bound  (23)  is  acceptable  for  us.  From  now  on,  we  work 
under  the  assumption  that  (n  +  1  )MN  is  not  divisible  by  squares  of  large 
primes. 


3.5.  The  instance  when  n  or  m  has  few  large  digits  in  a  prime  base 
p  £  J  dividing  MN.  Now  we  assume  that  MN  is  divisible  by  some  prime 
p  G  J .  Then  p  divides  either  n+i  +  l  for  some  i  £  { 1,2,...,  /’}  or  2n  +  2j  —  1 
for  some  j  €  {1,2,...,/}.  The  situation  here  is  entirely  symmetric  so  we 
only  consider  the  case  when  p  divides  n  +  i  +  l  for  some  i  G  {1,2,...,/}. 
Let 

n  =  n0px  +  nipA_1  4 - b  n\ 
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be  the  base  p  representation  of  n.  Observe  that  n\  =  p  —  i  —  1  is  fixed 
for  large  x  (namely  for  x  so  large  that  y  >  K  +  1),  and  so  it  is  enough  to 
investigate  the  number 

n'  =  nQpx  l  H - 1-  tia- u 


Similarly,  we  write 

m  =  rn-op7  +  ?nip7-1  +  •  •  •  +  m1 


for  the  base  p  representation  of  m  =  n  +  k.  Its  last  digit  is  k  —  i  —  1  if 
i  <  k  —  1  and  p  —  1  if  i  =  k,  so  it  is  enough  to  investigate  the  number 

ml  =  uiqp^~  1  +  •  •  •  +  m1- 

We  now  let  M^\x)  be  that  subset  of  n  G  M.^k\x)\M.^\x)  for  which 


either  s  =  #{1  <  j  <  A  —  1  :  nj  >  p/2}  <  A/4, 

or  t  =  #{1  <  j  <  7  —  1  :  rrij  >  p/2}  <  y/4. 

The  situation  is  entirely  symmetric  when  dealing  with  the  digits  of  n  in 
base  p,  and  with  the  digits  of  m  in  base  p,  so  we  deal  only  with  the  number  s. 
Fix  the  positive  integer  s  <  A/4.  The  indices  {ii, . .  ^  fis}  C  {1, . . . ,  A  —  1} 
for  which  rij  >  p/2  can  be  fixed  in  (^Z1)  <  (^)  ways,  and  summing  up  the 
number  of  such  choices  over  s  <  A/4,  we  get  that  the  total  number  of  such 
choices  is  at  most 


Here,  we  used  the  inequality  i\  >  {l/ey  valid  for  positive  integers  £,  together 
with  the  fact  that 


|_7tJ !  >  (N/e)M  =  (u/e)^  (l  +  0(l/u))M  >  {u/e)^ 

valid  for  all  real  numbers  it  >  1  (in  (24)  we  took  u  =  A/4). 

There  are  (p  —  l)/2  possible  digits  larger  than  p/2  and  (p  +  l)/2  pos¬ 
sible  digits  smaller  than  p/2.  Once  the  subset  of  indices  in 

{1,2,..., A  —  1}  has  been  chosen,  the  number  of  choices  for  the  digits 
{ni, . . . ,  riA-i}  such  that  n*.  >  p/2  for  j  =  1, . . . ,  s  and  the  remaining  A—  1— s 
digits  are  smaller  than  p/2  is  therefore 


(25) 


A— 1— s 

< 


V 


A— 1 


2A-i 


Since  certainly  A  =  O(logx),  whereas  p  >  y,  we  get  that 


Thus,  the  number  of  possibilities  for  the  number 

n"  =  nipx~2  H - b  tia- l 
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is,  after  multiplying  bounds  (24)  and  (25)  and  using  estimate  (26),  of  order 
at  most 

„A-1  \^A-l 


(27) 


A(4e 


iA/4  P_ 


< 


A  px 


2A  1  (4/e)A/4 


Say  n"  is  fixed  in  one  of  the  above  ways.  Since  x/2  <  n  <  x,  we  have  that 

x/2-ip-i-l )  ,  A-i,  //^  x-(p-i-l) 

-  <  n  =  nop  +  n  <  — 


P 


P 


so  that 


x/2  -  (p-i-l)  „  x-{p-i-l)  „ 

- n  <  nop  < - n  . 


P 


P 


The  number  of  multiples  of  px  1  in  the  interval 


1  = 


x/2  —  (p  —  i  —  1) 


—  n 


x  —  (p  —  i  —  1) 


—  n 


P 


p 


is  at  most 
(28) 


x 


2  p> 


3x 

+  1  <  — v, 
“  2px 


because  px  <  n  <  x.  In  the  above,  we  also  used  the  fact  that  the  length  of  X 
is  x/(2 p).  Thus,  the  number  of  ways  of  choosing  no  is  of  order  at  most  x/px. 
In  conclusion,  the  number  of  choices  for  n  is,  after  multiplying  bounds  (27) 
and  (28),  of  order  at  most 


x  A  px  1 

px  X  (4/e)A/4 


x’A 

^  p( 4/e)A/4 


Observe  that  since  x/2  <  n  <  x,  it  follows  that  x/(2p)  <  px  <  x,  so  that 
A  =  logx/ logp  +  0(1).  Since  p  <  z,  we  get  that  A  >  C3  log  log  x  +  0(1). 
This  is  a  lower  bound  on  A,  while  certainly  A  <  logx  is  an  upper  bound 
for  A.  Thus,  putting  C4  =  (03/4)  log(4/e),  we  get  that  the  number  of  choices 
for  such  n  is  of  order  at  most 

xlogx 

p(logx)C4 


The  same  inequality  applies  to  the  cardinality  of  the  subset  consisting  of 
those  n  6  A4W(x)\A4^(x)  for  which  t  <  7/4.  Further,  all  this  was  for  a 
fixed  i  (or  j)  in  {1,2,...,  A;}.  Summing  up  over  all  possible  values  of  i  (or  j ), 
and  then  over  all  the  possible  primes  p  G  77,  we  get  that 


#M(2k\x)  « 


x(log  X)K  1 

(fogx)c4  2_^p 


x(log  logx)2 
(logx)C4_1 


where  we  used  the  fact  that  the  estimate 


E 

p<u 


1 

P 


log  logn  +  0(1) 
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holds  for  all  u  >  10  (see  Theorem  427  in  [10]).  Similarly,  as  before,  we  put 
M2{x)  =  Uo<|fc|</\  (x)  and  get  that 


(29)  #M2(x)  < 


E 

0<\k\<K 


#M?\x)  « 


a;  (log  logx)0 
(logx)C4-1 


=  o 


(log  x) 


as  x  — >  oo,  provided  that  C4  >  5;  that  is,  C3  >  20/log(4/e),  which  we  are 
assuming.  In  fact,  we  take  C3  =  52.  The  bound  (29)  is  acceptable  for  us. 


3.6.  The  case  when  Tm^n  is  divisible  by  some  large  prime:  Set 

Up.  In  this  and  the  next  section,  we  suppose  that  there  exists  a  large  prime 
p  dividing  Tm^n-  We  start  by  noticing  that  this  is  always  the  case  when 
p  |  MN  for  some  prime  p  £  J  and  n  0  M^\x)  U  A4^\x).  Let  us  justify 
this  observation.  For  a  prime  q  and  a  positive  integer  u,  we  put  uq{u)  for  the 
exponent  of  q  in  the  factorization  of  u.  By  Rummer’s  theory  relating  the 
number  of  digits  of  n  and  m  in  base  p  which  exceed  p/2  with  the  exponent 
of  p  in  (2™)  and  (2/)  (see  [7]),  we  get,  from  the  fact  that  n  0  M.^\x)  that 

min  |  up  >  up  (("^))}  -  V4  >  13  log  log  x  +  0(1). 

Since  p2  does  not  divide  either  n  +  lorm  +  l  =  n  +  fc  +  l  (this  is  because 
n  0  M^\x)),  it  follows  that  for  large  x  we  have 

(30)  min  {vp  (Cn) ,  vp  (Cm)}  >  A/4  >  13  log  log  x  +  0(1)  >  2. 

Since  MN  is  not  divisible  by  squares  of  primes  in  J  (again  because  we  have 
n  0  A4^),  h  follows  by  inequality  (30)  and  the  STM IV-equation  (22),  that 
T m  and  Tv  are  divisible  by  all  the  prime  factors  of  M  and  N ,  respectively, 
which  belong  to  J .  In  particular,  there  exists  a  large  prime  dividing  T m^n, 
which  is  what  we  wanted. 


3.7.  The  case  when  TmTat  is  divisible  by  some  large  prime:  Sieves. 

Assume,  for  example,  that  pap\\TM  for  some  large  prime  p,  where  ap  >  1. 
From  (22),  we  read  that  p  \  cf>(MN).  The  same  conclusion  holds,  namely 
that  p  |  (j>(MN ),  when  pap  ||r jv-  Since  p2  \  MN  (because  n  0  M^\x)),  we 
conclude  that  there  exist  i  (or  j )  in  {1,2,...,  k}  such  that  p  \  n  +  i  +  1  (or 
p  |  2n  +  2j  —  1),  and  also  i\  (or  j\)  in  the  same  set  {1,2,...,  k}  and  a  prime 
q  congruent  to  1  modulo  p  dividing  n  +  i\  +  1  (or  2n  +  2ji  —  1).  In  all  cases, 
we  get  that  n  is  in  a  certain  arithmetic  progression  modulo  p  and  in  another 
arithmetic  progression  modulo  q,  so  by  the  Chinese  Remainder  Lemma,  n 
is  in  a  fixed  arithmetic  progression  modulo  pq.  The  number  of  such  n  <  x 
is  <  x/(pq)  +  1. 

We  consider  first  the  case  when  pq  <  lOx.  Then 


x 

pq 


+  1  < 


11s 
pq  ' 
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Keeping  i  (or  j )  fixed  and  i\  (or  ji)  fixed,  and  summing  first  over  all  the 
primes  q  <  x  +  2 K  —  1  <  3x  with  q  =  1  (mod  p),  then  over  all  p  £  J ,  we 
get  a  bound  of 


1  1  ^  x(loglogx)2 

-  <C  x  log  log  x  >  — r  <C - 

p  ^  q  <  p2  y 

p&J  q=  1  (mod  p)  p&J 

q<  Sx 


<31>  11 E!  E 


on  the  number  of  such  possibilities  n.  In  the  above,  we  used  the  Brun- 
Titchmarsh  bound 


(32) 


E 


q= 1  (mod  d) 
q<u 


1  log  log  u 
q  (f>{d) 


valid  for  all  real  numbers  u  >  10  and  all  positive  integers  d  (see  Lemma  1 
in  [2]). 

Summing  up  the  above  bound  (31)  over  all  pairs  i  (or  j )  and  i\  (or  j i), 
we  get  a  bound  of  order 


(33) 


x(loglog  x)2K~  .x(loglogx) 
y  ^  (logx)10 


on  the  number  of  such  possibilities  for  n.  To  organize  ideas,  we  write 
JA^\x)  for  the  set  of  positive  integers  under  scrutiny.  Recall  that  this 
set  is  the  set  of  n  e  .A/f^(x)\  (A4f^(x)  (J  for  which  there  exists 

a  large  prime  p  \  M N  and  q  \  MN  such  that  p  \  q  —  1  and  such  that 
furthermore  pq  <  10x.  Bound  (33)  gives 

x(log  logx)4 


#Mf\x)  « 
<(k). 


(logx 


1 10 


Putting  M3(x)  =  Uo<|fc|<A'-A/i3  (x)>  we  §et  that 
(34)  #M3(z)<  E 

0<|fc|<A' 

as  x  — >  oo.  The  bound  (34)  is  acceptable  for  us. 
We  now  take  a  look  at  the  case  when  pq  >  lOx. 


=  o 


x 

(logx)3 


Case  1.  The  case  when  p  \  N. 

Say  2n  +  2j  —  1  =  0  (mod  p)  for  some  j  G  {1,2,...,  k}.  We  now  write 
2n+2j  — 1  =  pa.  Suppose  first  that  n+i\  +  l  =  qb  for  some  i\  G  {1,  2, . . . ,  k}. 
Then  2 qb  =  (2 n  +  2 j  —  1)  +  (2ii  +  3  —  2 j).  Reducing  the  above  equation 
modulo  p ,  we  get  that  26  =  2ii  +  3  —  2 j  (mod  p).  However,  observe  that 
b  <  (n  +  K  +  l)/q  <  2x/q  <  p/4  for  large  x  (because  pq  >  lOx),  so  that 

1 26  -  (2*i  +  3  -  2j)\  <  26  +  (2 K  +  1)  <  |  +  2K  +  1  <  p 

for  large  x,  because  p  >  y  >  4 K  +  2.  Since  26  —  (2 i\  +  3  —  2 j)  is  a  multiple  of 
p  smaller  than  p  in  absolute  value,  it  should  be  the  number  zero,  but  this  is 
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impossible  because  it  is  an  odd  number.  A  similar  argument  deals  with  the 
case  when  2 n  +  2j\  —  1  =  qb  for  some  j\  €{1,2,...,  k}.  In  this  case,  j\  /  j, 
for  otherwise  we  would  get  that  p  \  b,  therefore  pq  <  2n  +  2 1\  —  1  <  3x 
for  large  x,  which  contradicts  the  fact  that  pq  >  lOx.  Further,  we  have 
b  <  (2 n  +  2 K  —  1  )/q  <  3 x/q  <  p/3  for  large  x  (again,  because  pq  >  10a;). 
Thus, 

qb  =  (2 n  +  2 j  -  1)  +  2(ji  -  j). 

Reducing  the  above  equation  modulo  p  we  get  b  =  2(ji  —  j)  (mod  p).  How¬ 
ever,  the  inequality 

I  b-  2(ji  -  j)  |  <|  +  2 K  <P 

holds  for  large  x,  and  since  b  —  2(ji  —  j )  is  a  multiple  of  p,  it  should  be 
the  number  zero,  which  is  again  impossible  since  this  number  is  in  fact  odd, 
because  b  is  odd.  This  takes  care  of  the  case  when  p  divides  N. 

Case  2.  The  case  when  p  \  M. 

Assume  that  p  \  n+i  + 1  for  some  *€{1,2,...,  k}.  As  in  Case  1,  we  write 
n  +  i  +  1  =  pa.  Suppose  first  that  q  \  n  +  i\  +  1  for  some  i\  €  {1,  2, . . . ,  k} 
and  write  n  +  i\  +  1  =  qb.  If  i  =  i\.  then  p  \  b,  so  pq  \  n  +  i  +  1,  so 
pq  <  x  +  K  +  1  <  2x  for  large  x,  contradicting  the  fact  that  pq  >  10a;. 
Thus,  i  /  i\.  Clearly,  b  <  (n  +  K  +  T)/q  <  2 x/q  <  p/ 4  for  large  x.  Then 
qb  =  n  +  i  +  1  +  (*i  —  i),  and  reducing  the  above  relation  modulo  p  we  get 
that  b  =  i\  —  i  (mod  p).  However,  for  large  x  we  have 

\b  —  (*i  —  *)|  <  b  +  K  <  ^  +  K  <  p. 

Thus,  the  number  b  —  (i\—  i )  is  zero,  showing  that  b  =  i\  —  i.  In  particular, 
*i  >  i,  and  we  get  the  equation 

(35)  n  +  i\  +  1  =  q(h  -  i). 

Another  possible  case  is  when  q  \  2n  +  2j\  —  1  for  some  j\  €  {1,  2, . . . ,  k}.  In 
this  case,  2n  +  2j\  —  l  —  qb,  so,  as  before,  b  <  (2n  +  2K  —  1  )/q  <  3x/q  <  p/3 
for  large  x.  Further,  qb  =  2n  +  2j\  —  1  =  (2 n  +  2i  +  2)  +  (2j\  —  2i  —  3). 
Reducing  the  above  relation  modulo  p,  we  get  b  =  2j\  —  2i  —  3  (mod  p) . 
Since  the  inequality 

| b  -  (2ji  -  2i  -  3)|  <  b  +  2K  +  1  <  -  +  2K  +  1  <  p 

3 

holds  for  large  x,  we  must  have  b  =  2j\  —  2i  —  3.  In  particular,  j\  >  i  +  2 
and 

(36)  2  n  +  2j\  —  1  =  q(2j\  —  2i  —  3). 

So  far,  we  learned  that  if  there  is  a  large  prime  factor  p  of  F mF at  and 
n  €  Mtyk\x)\(Mi(x)\JM2(x)\JMz{x)),  then  p  \  M,  so  p  |  n  +  i  +  1  for 
some  i  €  {1,2 and  either  relation  (35)  holds  with  some  i±  >  i  in 
{1,2,...,  A;}  and  some  prime  q,  or  relation  (36)  holds  for  some  ji  >  i  +  2  in 
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{1,2,...,  A:}  and  some  prime  q.  In  both  cases,  k  >  2  (in  fact,  in  the  second 
case  we  must  have  k  >  3).  Consider  the  forms 

2n  +  1  and  2  n  +  3. 


Rewriting  them  in  terms  of  the  prime  q,  they  become 
(37) 

2(ii  —  i)q  —  (2-ii  +  1)  and  2{i\  —  i)q  —  (2i\  —  1)  if  (35); 

(2ji  -  2i  -  3)q  -  2(jd  -  1)  and  (2jd  -  2i  -  3)q  -  2(ji  ~  2)  if  (36). 


Since  *i  >  *  >  1  (so  >  2),  and  j\  >  i  +  2  (so  ji  >  3),  in  both  cases, 
we  obtain  two  non-proportional  linear  forms  in  the  prime  q.  Also,  none 
of  the  two  forms  is  proportional  to  q  itself  (since  the  constant  coefficients 
are  not  zero).  Observe  that  2n  +  1  and  2n  +  3  are  free  of  primes  in  J7, 
otherwise,  we  are  already  in  the  case  when  n  G  A4-^'\x)  by  the  deduction 
from  Subsection  3.6,  and  Case  1  of  Subsection  3.7.  Hence,  we  have  two  non¬ 
proportional  linear  forms  with  nonzero  coefficients  in  the  prime  q  which  are 
free  of  primes  from  J .  It  follows  from  the  sieve  (see  Theorem  5.8  in  [8]), 
that  the  number  of  possibilities  for  q  <2x  (hence,  for  n  <  x)  is  of  order  at 
most 

x  TT  (i  _  x  /logy  V2  x(loglogx)4 

logX  ^1  \  pj  log  X  \  log  zj  (logx)3 

In  the  above  application  of  the  sieve,  we  implicitly  used  the  fact  that  for 
large  x  we  have  y  >  2 K  +  1,  and,  in  particular,  the  two  pairs  of  linear 
forms  in  q  shown  at  (37)  are  nonproportional  modulo  p  for  all  primes  p  G  J . 
Of  course,  this  was  for  fixed  i  and  i i ,  or  i  and  j±.  Summing  over  all  the 
possibilities  for  i  and  i\  or  j\ ,  we  get  that  if  we  put  M.^\x)  for  the  subset 
of  n  under  consideration,  we  get  that 

nU(k)(  ,  „  x(loglogx)4/i2  x(log logx)6 

4  X  (logx)3  (logx)3 

Putting  Ma(x)  =  Uo<|fe|<Ar-^4fc^(a::),  we  get  that 


(38) 


#Ma{x)<  Y,  #Mfc)(*)« 

0<\k\<K 


x(log  logx)7 
(logx)3 


X 

(logx)3"1"0^)  ’ 


as  x  -A  oo.  The  bound  (38)  is  acceptable  for  us.  This  completes  the  analysis 
of  the  case  when  T ^,/T n  is  a  multiple  of  some  large  prime. 


3.8.  The  Case  when  1^/17^  is  free  of  large  primes  and  k>  2.  By 

the  results  from  Section  3.6,  it  follows  that  MN  is  free  of  primes  from  J . 
Assume  that  k  >2.  Then  each  of 

n  +  2,  n  +  3,  2n  +  l,  2n  +  3, 

are  free  of  primes  p  G  J .  These  four  linear  forms  in  n  are  non-proportional. 
Thus,  by  the  sieve  (see  Theorem  5.7  in  [8]),  the  number  of  such  n  <  x  is  of 
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order  at  most 


4\  /logz\4  x(loglogx)8 

p)  X  Vogy)  (log.x)4 


Hence,  putting  M.§(x)  for  the  set  of  such  n  <  x,  we  get  that 

1 1  *  j  /  \  x(loglogx)8  (  x 
(39)  #M5(x)  «  (logx)4  =  0  ((b^)3 

as  x  — >■  oo.  The  bound  (39)  is  acceptable  for  us. 


3.9.  The  Case  when  Tm~Pn  is  free  of  large  primes  and  k  =  1.  Going 
back  to  the  results  from  Subsection  3.2,  we  see  that  D  =  1,3  and  P(UV )  <  3. 
Now  equation  (22)  tells  us  that  the  relation 

Si<KM)  =  T±4>{N) 

holds  with  some  positive  integers  Si  and  T\  with  P(S\Ti)  <  y.  Replacing 
M  and  N  by  (n  +  2)/UD  and  (2n  +  1  )/VD,  respectively  (see  relation  (19)), 
we  get  that  the  relation 

(40)  520(n  +  2)  =  T2(j)(2n  +  1) 

holds  with  some  positive  integers  S-2  and  T2  with  P^T^)  <  y.  We  also  have 
the  additional  information  that  (n  +  2)(2n  +  1)  is  free  of  primes  from  J . 


3.10.  The  structure  of  solutions  to  equation  (40).  To  handle  such  pos¬ 
itive  integers  n,  we  recall  that  if  we  put 

(41)  T(t,  w)  =  {n  <  t  :  P(n)  <  ru}, 


then  the  inequality 


t 

exp(u/2) 


where 


log^ 

logic 


holds  for  all  2  <  w  <  t  (see  [22,  Theorem  1,  p.  359]).  Better  estimates  for 
are  known  (see  [4]),  but  we  shall  not  need  them.  Let  Me(x)  be 
the  set  of  n  G  M(x)\  IJ'JL,  Mi(x)  such  that  P(n)  <  y  or  P(2n  +  1)  <  y.  By 
estimate  (41),  it  follows  that 

#M6(x)  <  2 #T(x,y)  <  - — - - yr 

exp  ((log  x)/ (2  log  y)) 

x  \ 

(iogx)3y  ’ 

as  x  -A  00.  This  is  acceptable  for  us.  From  now  on,  we  work  with  the 
remaining  numbers  n  in  M(x). 

We  next  write 


exp  ((log  x) / (20  log  log  x)) 


=  o 


(43)  n  +  2  =  pi  •  •  •  pra  and  2n  +  1  =  qi  ■  ■  ■  qsb , 

where  r  >  1,  s  >  1,  z  <  pi  <  ■  ■  ■  <  pr ,  z  <  qi  <  ■  ■  ■  <  qs  are  primes  and 
P(ab )  <  y.  Such  representations  for  n  and  2n  +  1  exist  because  n  0  Mq(x), 
so  there  exist  prime  factors  of  both  n  +  2  and  2n  +  1  exceeding  y)  hence, 
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exceeding  z  because  n  +  2  and  2n  +  1  are  coprime  with  the  primes  from  J . 
Further,  n  +  2  and  2n+l  are  not  divisible  by  squares  of  large  primes  because 
n  0  Put  L  =  logy  =  10  log  log  x.  We  let  Mj(x)  be  the  set  of  n 

such  that  either  a  >  exp(L2),  or  b  >  exp (L2).  Fixing  a,  the  number  of  such 
n  <  x  is  at  most  (x  +  2 )/a  <  2 x/a,  while  fixing  b,  the  number  of  such  n  is 
at  most  ( 2x  +  1  )/b  <  3 x/b.  Thus, 

(44)  #m7(x)  <  ^2  ^  +  ^2  y-5x  E  “  :=  5xS- 

a>exp(L2)  b>exp(L2)  a>exp(L2) 

P(a)<y  P(b)<y  P(a)<y 


Putting 

A(t)  =  {exp (L2)  <  a  <  t  :  P(a)  <  y}, 
we  get,  by  estimate  (41)  and  the  fact  that  for  t  >  exp(L2)  we  have 

log  t  .  L2 


> 


log  y  log  y 


=  10  log  log  x, 


that 

(45) 

#A(t)  _  #'P(t,  y)  eXp  ((l0g  f)/(2  log  y)  —  exp (5  log  log  ,x)  (log.x)5’ 


By  Abel’s  summation  formula  and  estimate  (45)  together  with  the  observa¬ 
tion  that  max{a,  b}  <  3x,  we  get 


cS  = 


(46) 


E  1 

a£A(3x) 


r3x 


d#A(t)  <  f#A(t)\t=3* 


V  t 


t=L 


#A(3x) 

<  ZWP — 1  + 


1 


f3x  dt 

—  < 


1 


x  (log  x)5  JL  t.  ""  (logx)4' 
Inserting  estimate  (46)  into  estimate  (44),  we  get 

(47)  #M7(X)  «  ^^4- 


r*3a; 


+ 


#A{t)dt 

t2 


This  is  acceptable  for  us.  From  now  on,  assume  that  maxja,  b]  <  exp (L2). 
Observe  next  that  since  pi  >  y  and  34  >  y,  and  for  large  x,  we  have  that 
z10L  =  x100/52  >  2x  +  1,  it  follows  that  max{r,  s}  <  10L.  Now  write 

(48) 

Pi  —  1  =  Aidi ,  where  p(Ai)  >  y  and  P(a,;)  <  y,  for  all  i  =  1, . . . ,  r. 

Similarly,  we  write 

(49) 

qj  —  1  =  Bjbj,  where  p(Bj)  >  y  and  P{bj)  <  y,  for  all  j  =  1, . . . ,  s. 

We  let  M.$(x)  to  be  the  subset  of  n  G  M(x)\  (J^=i  -Me(x)  such  that  either 
the  inequality  a*  >  exp(L2)  holds  for  some  i  =  1  or  the  inequality 

bj  >  exp (L2)  holds  for  some  j  =  l,...,s.  Assume  that  a*  >  exp(L2)  for 
some  i  =  1, . . . ,  r.  Then  there  exists  a  prime  p  dividing  n  +  2  and  a  divisor 
a  of  p  —  1  with  a  >  exp (L2)  and  P(a)  <  y.  Fixing  such  a  positive  integer 
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a  and  then  the  prime  p  =  1  (mod  a),  the  number  of  such  n  <  re  is  at  most 
(x  +  2)/p  <  2 x/p.  Summing  first  over  all  p  =  1  (mod  a),  then  over  all  the 
suitable  values  for  a  <  2x,  we  get  that  the  contribution  of  such  n  <  x  is  of 
order  at  most 

E  E  1  «  £  log  log  X  £ 

a>exp(L2)p=l  (mod  a)  a>exp(L2) 

P{a)<y  P<2x  P(a)<y 

(50)  <x(loglogx)2  ^2  ^ 

a>exp(L2) 

P(a)<y 

n  i  \2  c  x(loglogx)2 
=  x  (log  logx)  5  «  (lQgx)4  • 

In  the  above  inequalities  we  used  a  variety  of  inequalities  such  as  the  Brun- 
Titchmarsh  inequality  (32)  to  estimate  the  inner  sum  over  the  reciprocals  of 
the  primes  p  <  x  congruent  to  1  modulo  a,  the  minimal  order  of  the  Euler 
function  (13)  to  deduce  that  1  /(f>(a)  <C  (loglogx)/a  for  all  a  <  2x,  as  well  as 
the  estimate  (46).  The  case  when  bj  >  exp (L2)  holds  for  some  j  =  1, . . . ,  s 
is  analogous.  Namely,  in  this  case  we  get  that  there  exists  b  >  exp (L2)  with 
P(b )  <  y  dividing  q  —  1  for  some  prime  factor  q  of  2n  +  1.  Fixing  b  and  q, 
the  number  of  such  n  <  x  is  at  most  (2x  +  1  )/q  <  3 x/q.  Summing  up  the 
above  bound  over  all  q  =  1  (mod  b)  with  q  <  3x  and  then  over  all  b  <  3x 
with  b  >  exp (L2)  and  P(b)  <  y.  we  get  an  estimate  of  the  same  order  as 
(50).  Hence, 


(51) 


#Ms(x)  < 


x(log  logx)2 
(logx)4 


This  is  acceptable  for  us. 

We  now  work  with  the  remaining  set  of  n  £  A4(x).  Equation  (40)  implies 
that 


(52)  nA=n^ 

i= 1  j= 1 


Put  A4g(x)  for  the  set  of  such  n  with  min{r,  s}  >  2.  Let  n  be  such  a 
number.  We  certainly  know  that  the  primes  p\, . . .  ,pr,  q\, . . . ,  qs  are  all 
distinct.  Assume  that  pr  >  qs,  since  the  remaining  case  can  be  handled 
similarly.  Then  equation  (52)  shows  that  there  exists  j  G  {1,2,...,  s}  such 
that 


Dj=gcd(A1,Bj)>A11/s 


Pi  ~  1 
ai 


l/s 


> 


z  -  1 
exp(L2) 


i/(io  L) 


>  x 


1/L3 


where  the  last  inequality  holds  for  all  sufficiently  large  values  of  x.  Observe 
that  Dj  |  pi  —  1  and  Dj  \  qj  —  1.  Further,  the  congruences  n+2  =  0  (mod  p\) 
and  2n  +  2  =  0  (mod  qj)  put  n  <  x  into  a  certain  arithmetic  progression 
modulo  piqj  by  the  Chinese  Remainder  Lemma.  Since  r  >  2  and  pr  >  qs, 
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we  have  that  p\q3  <  p\qs  <  pipr  <  n  +  2  <  2x  (in  case  qs  >  pr,  the  last 
member  of  the  corresponding  inequality  is  q\qs  <  2n  +  1  <  3x,  which  is 
good  enough  for  the  purposes  of  the  subsequent  argument).  The  number  of 
n  <  x  in  the  above  arithmetic  progression  modulo  p\q3  is  of  order  x/(p\qj). 
We  now  vary  p\  and  qj  through  the  set  of  all  primes  not  exceeding  3x  and 
which  are  congruent  to  1  modulo  d,  while  keeping  d  =  Dj  fixed,  and  then 
over  all  d  >  x1^3 ,  getting  a  contribution  of  order 


E  £ 

d>x1/L3  Pl<qj<3x 

pi=qj=l  (mod  d) 


2x 

PiQj 


«x  J2 

d>x1/L 3 


/  \ 


E 

p<  3x 

\p=l  (mod  d) 


i 

p 


2 


(53) 


<c  x  (log  log  x) 2 

d>X1/L3 


1 


<c  x(iogiogx)4  yy 

d>a;1/1'3 


1 


x(log  logx)2 
Xl/L3 

A  similar  argument  applies  to  the  case  when  qs  >  pr,  and  the  number  of 
such  n  is  of  the  same  order  as  shown  in  (53)  above.  We  thus  get  that 


(54) 


#A49(x)  < 


x(log  logx) 
rl/L3 


=  o 


(log  x) 


as  x  — >  oo.  This  is  acceptable  for  us. 

So,  from  now  on,  we  assume  that  r  =  1  or  s  =  1.  We  show  that  r  =  1 
implies  s  =  1.  The  reciprocal  is  also  true  and  the  details  are  similar.  For  r  = 
1,  we  get  that  A\  =  B\  ■  ■  ■  Bs.  Using  this  relation  together  with  equations 
(48)  and  (49)  into  (43),  we  get 


(55)  2{A\a\  +  l)a  -  3  =  2(n  +  2)  -  3  =  2n  +  1  =  (Bib i  +  !)■■■  ( Bsbs  +  1)6, 


leading  to 

idd  ewo  u  i,  ,  mi  /  n  ,  o  ,  2sB1B2---Bsb1---bsb 

\B\B‘2  ■  ■  ■  Bs(2a\a  -  bib2  ■  ■  ■  bsb)\  <  2a  +  3  H - — — —  • 

mm{  Bt  :  1  <  i  <  s} 

Dividing  across  by  B\  ■  ■  ■  Bs  and  using  the  bound  s  <  10 L  together  with  the 
bound  max{a,  6, 6i, . . . ,  bs}  <  exp(L2),  and  assuming  that  j  €  {1,2,...,  s} 
is  such  that  Bj  is  minimal,  we  get  that 


|2aia 


b\b2  ■  ■  ■  6S6|  < 

< 


< 


2a  +  3  +  2s6i  •  •  •  bsb  5a2sbi  ■  ■  ■  bsb 
Bj  ~  ( Pj  -  l)/bj 

10  x  2s  exp  (L2(s  +  3)) 

Pi 

10 exp  (L2(10L  +  3)  +  10L)  i 

z 
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where  the  last  inequality  holds  provided  that  x  is  sufficiently  large.  Thus, 
for  large  x  we  have  2aai  =  b\  ■  ■  ■  bsb.  Using  this  information  in  equation 

(55) ,  we  get 

(56)  2a  —  3  =  (B\b\  +  1)  •  •  •  ( Bsbs  +  1)6  —  B\  ■  ■  ■  Bsb\  ■  ■  ■  bsb. 

The  right-hand  side  above  is  positive  so  a  >  2.  If  s  >  2,  in  the  right-hand 
side  above  we  have  a  sum  of  2s  —  1  terms  one  of  them  being 

B!  •  •  •  5s_iM2  •  •  •  bs-ib  >BX  = 

b\  26i 

Comparing  this  with  (56),  we  get  that 

z  <  pi  <  Aab\  <  4exp(2L2), 

which  is  false  for  large  x.  Hence,  s  =  1.  As  we  have  already  said,  a  similar 
argument  shows  that  s  =  1  implies  that  r  =  1. 

So,  from  now  on,  we  have  r  =  s  =  1,  so  A\  =  B\,  and  2aai  =  b\b.  Then 
equation  (43)  is 

2(a\A\  +  l)a  —  3  =  {B\bi  +  1)6, 

which  together  with  the  fact  that  2aa\Ai  =  B\b\  yields  2a  —  3  =  6. 


3.11.  Bounding  #Mr(x).  Since  n  +  2  =  pa  and  2n  +  1  =  q(2a  —  3),  we  get 
that 

(2  a)p  —  (2a  —  3  )q  =  3. 

Clearly,  the  greatest  common  divisor  between  a  and  2a  —  3  is  1  or  3  according 
to  whether  a  is  coprime  to  3  or  not.  Further,  the  smallest  positive  integer 
solution  (u,  v)  of  equation  2 au  —  (2a  —  3)v  =  3  is  (u,v)  =  (1, 1).  Hence,  we 
get  that 


1  +  (2a  —  3)A; 
1  +  2aA, 


if  3  fa; 


1  +  (2a/3  -  1)A; 
1  +  (2a/3)A, 


if  3  I  a. 


Since  ap  =  n  +  2  <  2x,  it  follows  that  p  <  2 x/a,  so  that  A  <  2x/ (a(2a  —  3)) 
if  3  f  a,  whereas  A  <  6x/(a(2a  —  3))  if  3  |  a.  In  both  cases,  we  have  that 
A  <  12 x/a2,  because  2a  —  3  >  a/2  for  all  a  >  2.  Thus,  fixing  a  <  exp (L2), 
we  have  that  A  <  12 x/a2.  Further,  A  has  the  property  that  a  pair  of  non¬ 
proportional  linear  forms  in  A  are  both  primes.  By  the  sieve  (see  Theorem 
5.7  in  [8]),  the  number  of  such  A  <  12.x/a2  is  of  order  at  most 

E 

W) 


(57) 


a2  log (x/a2) 


where  we  can  take  E  =  2a(2a  —  3).  Since  E  <  4a2,  by  inequality  (13),  we 
deduce  that  the  estimate  E/cj)(E )  <C  loglog(a  +  1)  holds  for  all  a  >  2.  Since 
a2  <  exp(2L2)  <  x1/2  for  large  x,  we  get  that  the  expression  (57)  is  of  order 


x 


^  (log  log(a  +  l))2  ^ 


(logx)2 
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Summing  up  the  last  bound  above  for  all  possible  values  of  a,  it  follows  that 
the  remaining  set  of  n  <  x,  call  it  has  cardinality  satisfying  the 

inequality 


(58)  #Mi0(x)  < 


X  \  -\ 

(logx)2 

V  '  2<a<exp(L2) 


(log  log(a  +  1))S 


< 


X 


(logx) 


3.12.  The  values  of  r.  To  conclude  the  proof  of  Theorem  2.1  it  suffices  to 
show  that  the  numbers  n  G  M\q{x)  lead  to  a  solution  (m,  n)  =  (n  +  1,  n)  of 
equation  4>{Cm) / (j>(Cn)  =  r  with  r  =  4.  We  go  back  through  the  argument 
from  Section  3.2  keeping  track  of  all  the  parameters.  We  have 


Cm.  —  C'n+l 


2(2  n  +  1) 
n  +  2 


Cn 


2bqr 

ap 


so  that 


apCn+ 1  =  2  bqCn. 

Let  d  =  gcd(a,  26)  =  gcd(a,4a  —  66).  Clearly,  d  G  {1,  2,  3,  6}.  Put  ai 
and  6i  =  2b/ d.  We  then  have 


a/d 


aipCn+i  =  biqCn. 


Thus,  there  exists  some  positive  integer  C  such  that 

(59)  CVi+i  =  6i  qC  and  Cn  =  a±pC. 

Observe  that  n  =  ap— 2  =  (a—l)p+(p—2),  and  a— 1  <  exp(L2)  <  z/2  <p/ 2 
for  large  x,  so  that  up  ( (2”))  =  1.  Since  p  \  n  +  2,  we  have  that  p\n  +  1,  so 
we  get  that  p\\Cn.  This  shows  that  p  \  C.  Further,  n  =  (6—  l)/2q+(q—  l)/2, 
and  (6  —  l)/2  <  exp(L2)  <  z/2  <  q/2  for  large  x,  therefore  vq  ((2^))  =  0. 
Hence,  q\Cn,  showing  that  q  \  C . 

We  now  study  the  exponents  of  the  small  primes  in  (2”) .  Let  p  <  y  be 
any  small  prime.  Write,  as  in  Section  3.5, 

n  =  n0px  +  nipA“1  H - h  np 

for  the  base  p  representation  of  n.  Let  us  count  the  number  of  n  <  x  such 
that  for  some  p  <  y,  we  have  that 

s  =  #{1  <  i  <  A  :  rii  <  p/2}  <  L3. 

There  are  A  +  1  possible  locations  {0, 1, . . . ,  A}.  There  are 

(60)  ^  ^  <  +  1)l3  <  exp(L4)  for  large  x 

s<L3  x  S  ' 

possibilities  of  choosing  subsets  {ii, . . . ,  zs}  with  at  most  L3  elements,  where 
the  digits  smaller  than  p/2  are  located.  Once  these  positions  are  chosen, 
the  number  of  possibilities  of  actually  assigning  digits  rij  <  p/2  whenever 
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j  e  {*i, . . . ,  is},  and  n3  >  p/ 2  whenever  j  G  {0, 1, ,  A}\{ii, . . . ,  is},  is  at 
most 


(61) 


A+l— s 


1\  A+1 


2A 


PJ 


x 

L3^’ 


and  this  estimate  is  uniform  in  2  <  p  <  y.  Note  that 

logx 


(62)  A=!^+0(1)>^+0(1)  = 


logy 


logy 


10  log  log  x 


+  0{  1)> 


logx 


4 L  log(1.3) 


for  large  x  uniformly  in  p  <  y. 

Multiplying  bounds  (60)  and  (61)  and  using  the  lower  bound  (62)  on  A, 
we  get  that  the  number  of  such  n  <  x  is,  for  a  given  p,  at  most 

xyexp(L4)  xyexp(L4) 

1.3logz/(4Llog(1.3))  a;l/(4L) 

Summing  the  above  bound  over  all  primes  y  <  y,  we  get  a  bound  of 

xy2 exp(L4) 

X V 0-0 

Putting  A4n(x)  for  the  set  of  such  n  <  x,  we  get 


(63) 


#Mu(x)  < 


xy2  exp(L4 

xVOO 


=  o 


(logx) 


as  x  — >  oo.  This  is  acceptable  for  us. 

From  now  on,  we  work  with  the  remaining  numbers  n.  For  them,  the 
inequality  t,p((2;”))  >  L 3  holds  for  all  small  primes  p.  If  p  divides  n  +  2,  then 
p  divides  a.  Since  a  <  exp(L2),  it  follows  that  vp{a)  <  (L2)/( log 2)  <  2 L2. 
Hence,  vp{Cn)  >  L3  —  2L2.  Otherwise,  that  is  if  p  does  not  divide  n  +  2, 
then  vp(Cn )  =  vp{ (  ”) )  >  L3.  Hence,  at  any  rate,  if  p  divides  a\b\ ,  then 

//p(C'n)  >  L3  -  2L2  >  2L2  >  i/p(ai6i). 


Going  back  to  equation  (59),  it  follows  that  all  primes  dividing  ai&i  divide 
in  fact  C.  Thus,  write 

c  =  rairfclr, 

where 

rai  =  n  pap  and  rH  =  n  pap- 

fP»\\C  p^WC 

p\a\  p\bi 

Equations  (59)  become 


cn+i  =  5iyrair6lr 


and  Cn  =  aipTaiTblT. 


Taking  the  Euler  functions,  we  get 


cp(Cn+i)  =  0(6i)  (g- l)0(rai  )Fbl  0(r);  4(Cn)  = 

Put  rad(fe)  =  Ylp\kP  f°r  radical  of  k.  Using 


<t>{a  i)  —  °i 


<^(rad(ai)) 

rad(ai) 


and 


^(Fai)  =  rai 


0(rad(ai)) 
rad(ai)  ’ 
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the  similar  relations  with  a\  replaced  by  b\ ,  and  the  fact  that 


b{q-l)  =  2  a(p-  1), 


therefore 


q~  1 

p  —  1 


we  get  that 


2a 

T 


4 (a/d) 
26/d 


4ai 

"&T’ 


#gn+l) 

0(Cn) 


1  rad(fei) 
^(rad(ai)) 
0,1  rad(ai) 


X 


r01 


i/i(rad(ai) 

rad(ai) 


r 


ai 


p  0(rad(6i) 
1  bl  rad(fei) 


=  4. 


The  case  r  =  1/4  comes  from  the  case  when  m  <  n  (in  particular,  when 
m  =  n  —  1).  We  now  conclude  that  the  conclusions  of  Theorem  2.1  hold. 
Indeed,  estimates  (23),  (29),  (34),  (38),  (39),  (42),  (47),  (51)  and  (54)  show 
that  the  cardinalities  of  A ij  ( x )  for  j  =  1 , . . . ,  9  are  bounded  as  shown  in 
(10),  while  if  n  has  made  it  to  Alio(.'c),  whose  cardinality  is  bounded  as  in 
(58),  but  r  /  4,  1/4,  then  it  must  be  the  case  that  in  fact  n  €  A4n(x),  a  set 
whose  cardinality  is  bounded,  from  inequality  (63),  by  the  right-hand  side 
of  (10).  Thus,  r  =  4,  1/4  remain  the  only  options  for  n  €  A4io(x)\A4n(x), 
and  this  is  bounded  as  shown  in  (11)  by  estimate  (58). 


4.  Open  questions 

Numerically,  it  seems  that  {( t>(Cn)}n>2  is  an  increasing  sequence.  We 
leave  this  as  a  research  problem  for  the  reader.  It  would  be  interesting  to 
study  the  Carmichael  A-function  of  the  Catalan  numbers.  We  conjecture 
that  for  all  7c  >  1,  there  are  infinitely  many  positive  integers  such  that 

KCn+l)  =  A  (Cn+2)  =  ■■■  =  A  (Cn+k). 
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5.  Appendix 

We  display  in  Figure  1  a  graph  showing  the  number  of  solutions  of  the 
equation  4  =  cf>(Cn+i) / 4>(Cn)  in  the  range  n  €  [0,  3000]. 
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